|
|
Registro Completo |
Biblioteca(s): |
Embrapa Pecuária Sudeste; Embrapa Pesca e Aquicultura. |
Data corrente: |
09/07/2019 |
Data da última atualização: |
08/01/2020 |
Tipo da produção científica: |
Artigo em Periódico Indexado |
Autoria: |
FERREIRA FILHO, D.; BUENO FILHO, J. S. de S.; REGITANO, L. C. de A.; ALENCAR, M. M. de; ALVES, R. R.; BAENA, M. M.; MEIRELLES, S. L. C. |
Afiliação: |
Diógenes Ferreira Filho, UFRRJ; Júio Sílvio de Sousa Bueno Filho, UFLA; LUCIANA CORREIA DE ALMEIDA REGITANO, CPPSE; MAURICIO MELLO DE ALENCAR, CPPSE; ROSIANA RODRIGUES ALVES, CNPASA; Marielle Moura Baena, UFLA; Sarah Laguna Conceição Meirelles, UFLA. |
Título: |
Tournaments between markers as a strategy to enhance genomic predictions. |
Ano de publicação: |
2019 |
Fonte/Imprenta: |
Plos One, v. 14, n. 7, e0219448, p. 1-17, 2019. |
DOI: |
10.1371/journal.pone.0219448 |
Idioma: |
Inglês |
Conteúdo: |
Analysis of a large number of markers is crucial in both genome-wide association studies (GWAS) and genome-wide selection (GWS). However there are two methodological issues that restrict statistical analysis: high dimensionality (p>>n) and multicollinearity. Although there are methodologies that can be used to fit models for data with high dimensionality (eg,the Bayesian Lasso), a big problem that can occurs in this cases is that the predictive ability of the model should perform well for the individuals used to fit the model, but should not perform well for other individuals, restricting the applicability of the model. This problem can be circumvent by applying some selection methodology to reduce the number of markers (but keeping the markers associated with the phenotypic trait) before adjusting a model to predict GBVs. We revisit a tournament-based strategy between marker samples, where each sample has good statistical properties for estimation: n>p and low collinearity. Such tournaments are elaborated using multiple linear regression to eliminate markers. This method is adapted from previous works found in the literature. We used simulated data as well as real data derived from a study with SNPs in beef cattle. Tournament strategies not only circumvent the p>>n issue, but also minimize spurious associations. For real data, when we selected a few more than 20 markers, we obtained correlations greater than 0.70 between predicted Genomic Breeding Values (GBVs) and phenotypes in validation groups of a cross-validation scheme; and when we selected a larger number of markers (more than 100), the correlations exceeded 0.90, showing the efficiency in identifying relevant SNPs (or segregations) for both GWAS and GWS. In the simulation study, we obtained similar results. MenosAnalysis of a large number of markers is crucial in both genome-wide association studies (GWAS) and genome-wide selection (GWS). However there are two methodological issues that restrict statistical analysis: high dimensionality (p>>n) and multicollinearity. Although there are methodologies that can be used to fit models for data with high dimensionality (eg,the Bayesian Lasso), a big problem that can occurs in this cases is that the predictive ability of the model should perform well for the individuals used to fit the model, but should not perform well for other individuals, restricting the applicability of the model. This problem can be circumvent by applying some selection methodology to reduce the number of markers (but keeping the markers associated with the phenotypic trait) before adjusting a model to predict GBVs. We revisit a tournament-based strategy between marker samples, where each sample has good statistical properties for estimation: n>p and low collinearity. Such tournaments are elaborated using multiple linear regression to eliminate markers. This method is adapted from previous works found in the literature. We used simulated data as well as real data derived from a study with SNPs in beef cattle. Tournament strategies not only circumvent the p>>n issue, but also minimize spurious associations. For real data, when we selected a few more than 20 markers, we obtained correlations greater than 0.70 between predicted Genomic Breeding Values (GBVs) and phenotyp... Mostrar Tudo |
Palavras-Chave: |
Genome-wide; Genomic Breeding Values; GWAS; GWS; SNPs. |
Thesagro: |
Genoma; Genótipo; Marcador Genético; Seleção Genética. |
Thesaurus Nal: |
Genetic markers; Genomics; Genotyping. |
Categoria do assunto: |
G Melhoramento Genético |
URL: |
https://ainfo.cnptia.embrapa.br/digital/bitstream/item/199471/1/Tournaments-between-markers-as-a-strategy-correcao.pdf
|
Marc: |
LEADER 02769naa a2200349 a 4500 001 2110534 005 2020-01-08 008 2019 bl uuuu u00u1 u #d 024 7 $a10.1371/journal.pone.0219448$2DOI 100 1 $aFERREIRA FILHO, D. 245 $aTournaments between markers as a strategy to enhance genomic predictions.$h[electronic resource] 260 $c2019 520 $aAnalysis of a large number of markers is crucial in both genome-wide association studies (GWAS) and genome-wide selection (GWS). However there are two methodological issues that restrict statistical analysis: high dimensionality (p>>n) and multicollinearity. Although there are methodologies that can be used to fit models for data with high dimensionality (eg,the Bayesian Lasso), a big problem that can occurs in this cases is that the predictive ability of the model should perform well for the individuals used to fit the model, but should not perform well for other individuals, restricting the applicability of the model. This problem can be circumvent by applying some selection methodology to reduce the number of markers (but keeping the markers associated with the phenotypic trait) before adjusting a model to predict GBVs. We revisit a tournament-based strategy between marker samples, where each sample has good statistical properties for estimation: n>p and low collinearity. Such tournaments are elaborated using multiple linear regression to eliminate markers. This method is adapted from previous works found in the literature. We used simulated data as well as real data derived from a study with SNPs in beef cattle. Tournament strategies not only circumvent the p>>n issue, but also minimize spurious associations. For real data, when we selected a few more than 20 markers, we obtained correlations greater than 0.70 between predicted Genomic Breeding Values (GBVs) and phenotypes in validation groups of a cross-validation scheme; and when we selected a larger number of markers (more than 100), the correlations exceeded 0.90, showing the efficiency in identifying relevant SNPs (or segregations) for both GWAS and GWS. In the simulation study, we obtained similar results. 650 $aGenetic markers 650 $aGenomics 650 $aGenotyping 650 $aGenoma 650 $aGenótipo 650 $aMarcador Genético 650 $aSeleção Genética 653 $aGenome-wide 653 $aGenomic Breeding Values 653 $aGWAS 653 $aGWS 653 $aSNPs 700 1 $aBUENO FILHO, J. S. de S. 700 1 $aREGITANO, L. C. de A. 700 1 $aALENCAR, M. M. de 700 1 $aALVES, R. R. 700 1 $aBAENA, M. M. 700 1 $aMEIRELLES, S. L. C. 773 $tPlos One$gv. 14, n. 7, e0219448, p. 1-17, 2019.
Download
Esconder MarcMostrar Marc Completo |
Registro original: |
Embrapa Pecuária Sudeste (CPPSE) |
|
Biblioteca |
ID |
Origem |
Tipo/Formato |
Classificação |
Cutter |
Registro |
Volume |
Status |
URL |
Voltar
|
|
Registros recuperados : 1 | |
1. | | MATOS, C. F.; PINHEIRO, E. F. M.; CAMPOS, D. V. B. de. Biogas and biofertilizer production from dairy cattle manure under organic and conventional production systems in Seropedica, Rio de Janeiro (Brazil). In: WORLD CONGRESS OF SOIL SCIENCE, 21., 2018, Rio de Janeiro. Soil science: beyond food and fuel: proceedings... Viçosa, MG: SBCS, 2019. v. 2, p. 150-151. WCSS 2018.Tipo: Resumo em Anais de Congresso |
Biblioteca(s): Embrapa Solos. |
| |
Registros recuperados : 1 | |
|
|
Registro completo
Biblioteca(s): |
Catálogo Coletivo de Periódicos Embrapa; Embrapa Florestas; Embrapa Meio Ambiente; Embrapa Meio-Norte; Embrapa Trigo. |
Identificador: |
69 |
Data corrente: |
09/05/2002 |
Data da última atualização: |
30/04/2015 |
Código do título: |
0900016 |
ISSN: |
0301-2123 |
Código CCN: |
000690-4 |
Título e Subtítulo: |
ACTA BIOLOGICA PARANAENSE |
Entidade: |
Instituto de Biologia da Universidade Federal do Parana |
Local de publicação: |
Curitiba, PR |
Periodicidade: |
Irregural |
Inicio de publicação: |
1972 |
Coleções da unidade: |
Embrapa Florestas 1972 1(1/2,3/4); 1975 4(1/2,3/4); 1976 5(3/4); 1983 12(1/4); 1985 14(1/4); 1986 15(1/4); 1987 16(1/4); 1988 17(1/4); 1989 18(1/4); 1990 19(1/4); 1991 20(1/4); 1992 21(1/4); 1994 23(1/4); 1995 24(1/4); 1996 25(1/4); 1997 26(1/4); 1998 27(1/4); 1999 28(1/4); 2000 29(1/4); 2001 30(1/4); 2002 31(1/4); 2003 32(1/4); 2004 33(1/4); 2005 34(1/4); 2006 35(1/2,3/4); 2007 36(1/2,3/4); 2008 37(1/2,3/4); 2009 38(1/2,3/4); 2010 39(1/2,3/4); 2011 40(1/2,3/4); 2012 41(1/2,3/4)
Embrapa Meio Ambiente 1972/2012 1(1/2); 4 (1/2,3/4); 5(1/2,3/4); 6(1/4); 8; 7(1/4); 8(1/4); 9(1/4); 10(1/4); 11(1/4); 12(1/4); 13(1/4); 14(1/4); 15(1/4); 16(1/4); 17(1/4); 18(1/4); 19(1/4); 20(1/4); 21(1/4); 22(1/4); 23(1/4); 24(1/4); 25(1/4); 26(1/4); 27(1/4); 28(1/4); 29(1/4); 30(1/4); 31(1/4); 32(1/4); 33(1/4); 34(1/4); 35(1/2, 3/4); 36(1/4); 37(1/4); 38(1/4); 39(1/4); 40(1/4); 41(1/4)
Embrapa Meio-Norte 1972 1(1/2); 1985-2008 14-38; 2011 40(1-4); 2012 41(1/4) Classificação: 574.05
Embrapa Trigo 1972/91 1 1972; 2 1973; 3 1974; 4 1975; 5 1976; 6 1977; 7 1978; 8 1979; 9 1980; 10 1981; 11 1982; 12 1983; 13 1984; 14 1985; 15 1986; 16 1987; 18 1989; 19 1990; 20 1991; 28(1-4) 1999. Classificação: 574.05 |
|
Fechar
|
|
|